AITopics | computational node

Collaborating Authors

computational node

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A GPU-Accelerated Bi-linear ADMM Algorithm for Distributed Sparse Machine Learning

Olama, Alireza, Lundell, Andreas, Kronqvist, Jan, Ahmadi, Elham, Camponogara, Eduardo

arXiv.org Artificial IntelligenceJun-26-2024

This paper introduces the Bi-linear consensus Alternating Direction Method of Multipliers (Bi-cADMM), aimed at solving large-scale regularized Sparse Machine Learning (SML) problems defined over a network of computational nodes. Mathematically, these are stated as minimization problems with convex local loss functions over a global decision vector, subject to an explicit $\ell_0$ norm constraint to enforce the desired sparsity. The considered SML problem generalizes different sparse regression and classification models, such as sparse linear and logistic regression, sparse softmax regression, and sparse support vector machines. Bi-cADMM leverages a bi-linear consensus reformulation of the original non-convex SML problem and a hierarchical decomposition strategy that divides the problem into smaller sub-problems amenable to parallel computing. In Bi-cADMM, this decomposition strategy is based on a two-phase approach. Initially, it performs a sample decomposition of the data and distributes local datasets across computational nodes. Subsequently, a delayed feature decomposition of the data is conducted on Graphics Processing Units (GPUs) available to each node. This methodology allows Bi-cADMM to undertake computationally intensive data-centric computations on GPUs, while CPUs handle more cost-effective computations. The proposed algorithm is implemented within an open-source Python package called Parallel Sparse Fitting Toolbox (PsFiT), which is publicly available. Finally, computational experiments demonstrate the efficiency and scalability of our algorithm through numerical benchmarks across various SML problems featuring distributed datasets.

algorithm, constraint, node, (13 more...)

arXiv.org Artificial Intelligence

2405.16267

Country:

Europe > Finland > Ostrobothnia > Vaasa (0.05)
South America > Brazil > Santa Catarina > Florianópolis (0.04)
Europe > Sweden > Stockholm > Stockholm (0.04)
Asia > China (0.04)

Genre: Research Report > New Finding (0.48)

Industry: Information Technology (0.68)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Support Vector Machines (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Add feedback

Improving the Performance of Echo State Networks Through Feedback

Ehlers, Peter J., Nurdin, Hendra I., Soh, Daniel

arXiv.org Machine LearningDec-22-2023

Reservoir computing, using nonlinear dynamical systems, offers a cost-effective alternative to neural networks for complex tasks involving processing of sequential data, time series modeling, and system identification. Echo state networks (ESNs), a type of reservoir computer, mirror neural networks but simplify training. They apply fixed, random linear transformations to the internal state, followed by nonlinear changes. This process, guided by input signals and linear regression, adapts the system to match target characteristics, reducing computational demands. A potential drawback of ESNs is that the fixed reservoir may not offer the complexity needed for specific problems. While directly altering (training) the internal ESN would reintroduce the computational burden, an indirect modification can be achieved by redirecting some output as input. This feedback can influence the internal reservoir state, yielding ESNs with enhanced complexity suitable for broader challenges. In this paper, we demonstrate that by feeding some component of the reservoir state back into the network through the input, we can drastically improve upon the performance of a given ESN. We rigorously prove that, for any given ESN, feedback will almost always improve the accuracy of the output. For a set of three tasks, each representing different problem classes, we find that with feedback the average error measures are reduced by $30\%-60\%$. Remarkably, feedback provides at least an equivalent performance boost to doubling the initial number of computational nodes, a computationally expensive and technologically challenging alternative. These results demonstrate the broad applicability and substantial usefulness of this feedback scheme.

artificial intelligence, esn, machine learning, (15 more...)

arXiv.org Machine Learning

2312.15141

Country:

North America > United States > Arizona (0.04)
Europe > Sweden > Uppsala County > Uppsala (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(4 more...)

Genre: Research Report > New Finding (0.66)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

A Novel Evolutionary Algorithm for Hierarchical Neural Architecture Search

Christoforidis, Aristeidis, Kyriakides, George, Margaritis, Konstantinos

arXiv.org Artificial IntelligenceMay-4-2023

In this work, we propose a novel evolutionary algorithm for neural architecture search, applicable to global search spaces. The algorithm's architectural representation organizes the topology in multiple hierarchical modules, while the design process exploits this representation, in order to explore the search space. We also employ a curation system, which promotes the utilization of well performing sub-structures to subsequent generations. We apply our method to Fashion-MNIST and NAS-Bench101, achieving accuracies of $93.2\%$ and $94.8\%$ respectively in a relatively small number of generations.

artificial intelligence, evolutionary algorithm, machine learning, (16 more...)

arXiv.org Artificial Intelligence

2107.08484

Country:

Europe > North Macedonia (0.05)
Europe > Greece > Central Macedonia > Thessaloniki (0.05)

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.92)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.86)

Add feedback

StarNet: Gradient-free Training of Deep Generative Models using Determined System of Linear Equations

Zadeh, Amir, Benoit, Santiago, Morency, Louis-Philippe

arXiv.org Machine LearningJan-3-2021

In this paper we present an approach for training deep generative models solely based on solving determined systems of linear equations. A network that uses this approach, called a StarNet, has the following desirable properties: 1) training requires no gradient as solution to the system of linear equations is not stochastic, 2) is highly scalable when solving the system of linear equations w.r.t the latent codes, and similarly for the parameters of the model, and 3) it gives desirable least-square bounds for the estimation of latent codes and network parameters within each layer.

equation, linear equation, starnet, (17 more...)

arXiv.org Machine Learning

2101.00574

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.14)
South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.60)

Add feedback

Distributed Answer Set Coloring: Stable Models Computation via Graph Coloring

De Bortoli, Marco

arXiv.org Artificial IntelligenceSep-18-2019

Answer Set Programming (ASP) is a famous logic language for knowledge representation, which has been really successful in the last years, as witnessed by the great interest into the development of efficient solvers for ASP. Yet, the great request of resources for certain types of problems, as the planning ones, still constitutes a big limitation for problem solving. Particularly, in the case the program is grounded before the resolving phase, an exponential blow up of the grounding can generate a huge ground file, infeasible for single machines with limited resources, thus preventing even the discovering of a single non-optimal solution. To address this problem, in this paper we present a distributed approach to ASP solving, exploiting distributed computation benefits in order to overcome the just explained limitations. The here presented tool, which is called Distributed Answer Set Coloring (DASC), is a pure solver based on the well-known Graph Coloring algorithm. DASC is part of a bigger project aiming to bring logic programming into a distributed system, started in 2017 by Federico Igne with mASPreduce and continued in 2018 by Pietro Totis with a distributed grounder. In this paper we present a low level implementation of the Graph Coloring algorithm, via the Boost and MPI libraries for C++. Finally, we provide a few results of the very first working version of our tool, at the moment without any strong optimization or heuristic.

artificial intelligence, logic & formal reasoning, node, (14 more...)

arXiv.org Artificial Intelligence

doi: 10.4204/EPTCS.306.60

1909.08263

Country:

North America > United States (0.93)
Europe (0.93)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)

Add feedback

Learning Nonlinear Input-Output Maps with Dissipative Quantum Systems

Chen, Jiayin, Nurdin, Hendra I.

arXiv.org Artificial IntelligenceMay-16-2019

In this paper, we develop a theory of learning nonlinear input-output maps with fading memory by dissipative quantum systems, as a quantum counterpart of the theory of approximating such maps using classical dynamical systems. The theory identifies the properties required for a class of dissipative quantum systems to be {\em universal}, in that any input-output map with fading memory can be approximated arbitrarily closely by an element of this class. We then introduce an example class of dissipative quantum systems that is provably universal. Numerical experiments illustrate that with a small number of qubits, this class can achieve comparable performance to classical learning schemes with a large number of tunable parameters. Further numerical analysis suggests that the exponentially increasing Hilbert space presents a potential resource for dissipative quantum systems to surpass classical learning schemes for input-output maps.

artificial intelligence, computational node, machine learning, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1007/s11128-019-2311-9

1901.01653

Country:

North America > United States > New York (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.84)

Add feedback

Exploiting the Dynamics of Soft Materials for Machine Learning

#artificialintelligenceMay-3-2018, 14:11:53 GMT

Soft materials have been attracting attention because they add unprecedented functionality to machines and devices. This functionality enables soft materials to be used in a vast array of applications, such as grasping objects,1,2 human–robot interactions,3 medical and surgical tools,4 and prosthetics and wearables.5 The inherent softness of such materials results in increased adaptivity and decreased damage to other surfaces during contact.6,7 In addition, robots made with soft materials are able to generate complex behaviors with simpler actuations by partially outsourcing control to the morphological and material properties,8 which enhances the active coupling between control, body, and environment.9,10 Compared with rigid materials, soft materials exhibit rich dynamics including a variety of properties, such as nonlinearity, elasticity, and high dimensionality. In this article, we demonstrate that these dynamic properties constitute an asset that can be effectively employed for machine learning purposes. Our approach is based on a technique called reservoir computing,11–13 which is a framework rooted in recurrent neural network learning. When a high-dimensional dynamical system, which is referred to as the reservoir, is driven with input streams, it generates transient dynamics that operate as a type of temporal and finite kernel that facilitates the separation of the input states. If the dynamics involve short-term memory and nonlinear processing of the input stream, then nonlinear dynamical systems can be learned by adjusting a linear, static readout from the high-dimensional state space of the reservoir. We exploit the rich physical dynamics of soft materials directly as a reservoir for temporal machine learning problems.

artificial intelligence, machine learning, sensory time sery, (14 more...)

#artificialintelligence

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.55)

Add feedback

DID: Distributed Incremental Block Coordinate Descent for Nonnegative Matrix Factorization

Gao, Tianxiang, Chu, Chris

arXiv.org Machine LearningFeb-24-2018

Nonnegative matrix factorization (NMF) has attracted much attention in the last decade as a dimension reduction method in many applications. Due to the explosion in the size of data, naturally the samples are collected and stored distributively in local computational nodes. Thus, there is a growing need to develop algorithms in a distributed memory architecture. We propose a novel distributed algorithm, called \textit{distributed incremental block coordinate descent} (DID), to solve the problem. By adapting the block coordinate descent framework, closed-form update rules are obtained in DID. Moreover, DID performs updates incrementally based on the most recently updated residual matrix. As a result, only one communication step per iteration is required. The correctness, efficiency, and scalability of the proposed algorithm are verified in a series of numerical experiments.

algorithm, artificial intelligence, machine learning, (14 more...)

arXiv.org Machine Learning

1802.08938

Country: North America > United States (0.28)

Genre: Research Report (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.66)

Add feedback

DID: Distributed Incremental Block Coordinate Descent for Nonnegative Matrix Factorization

Gao, Tianxiang (Iowa State University) | Chu, Chris (Iowa State University)

AAAI ConferencesFeb-8-2018

Nonnegative matrix factorization (NMF) has attracted much attention in the last decade as a dimension reduction method in many applications. Due to the explosion in the size of data, naturally the samples are collected and stored distributively in local computational nodes. Thus, there is a growing need to develop algorithms in a distributed memory architecture. We propose a novel distributed algorithm, called distributed incremental block coordinate descent (DID), to solve the problem. By adapting the block coordinate descent framework, closed-form update rules are obtained in DID. Moreover, DID performs updates incrementally based on the most recently updated residual matrix. As a result, only one communication step per iteration is required. The correctness, efficiency, and scalability of the proposed algorithm are verified in a series of numerical experiments.

algorithm, artificial intelligence, machine learning, (17 more...)

AAAI Conferences

Thirty-Second AAAI Conference on Artificial Intelligence

Country: North America > United States (0.46)

Genre: Research Report (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.48)

Add feedback

Lessons learned from building a Hello World Neural Network - Blendo

#artificialintelligenceJul-22-2017, 14:00:31 GMT

My personal experience with Neural Networks began some time ago. Reading about the amazing things a neural network could do made me eager to explore this problem-solving approach that has attracted so much attention during the past few years. I remember myself impressed by a model that generates natural language descriptions of images and their regions, developed at the Stanford University in 2015, thinking that I would like to be able to do similar things at some point. From my experience in other machine learning related topics, very detailed mathematical explanations, full of derivatives and equations make understanding difficult. So, I decided to ignore them for the time being.

artificial intelligence, machine learning, neural network, (17 more...)

#artificialintelligence

Industry: Education (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback